55 research outputs found

    Gene Network Reconstruction using Global-Local Shrinkage Priors

    Get PDF
    Reconstructing a gene network from high-throughput molecular data is an important but challenging task, as the number of parameters to estimate easily is much larger than the sample size. A conventional remedy is to regularize or penalize the model likelihood. In network models, this is often done locally\textit{locally} in the neighborhood of each node or gene. However, estimation of the many regularization parameters is often difficult and can result in large statistical uncertainties. In this paper we propose to combine local regularization with global\textit{global} shrinkage of the regularization parameters to borrow strength between genes and improve inference. We employ a simple Bayesian model with nonsparse, conjugate priors to facilitate the use of fast variational approximations to posteriors. We discuss empirical Bayes estimation of hyperparameters of the priors, and propose a novel approach to rank-based posterior thresholding. Using extensive model- and data-based simulations, we demonstrate that the proposed inference strategy outperforms popular (sparse) methods, yields more stable edges, and is more reproducible. The proposed method, termed ShrinkNet\texttt{ShrinkNet}, is then applied to Glioblastoma to investigate the interactions between genes associated with patient survival.This work was supported by the Center for Medical Systems Biology (CMSB), and the European Union Grant EpiRadBio, established by the Netherlands Genomics Initiative/Netherlands Organization for Scientific Research (NGI/NWO), nr. FP7-269553

    On dynamic network entropy in cancer

    Get PDF
    The cellular phenotype is described by a complex network of molecular interactions. Elucidating network properties that distinguish disease from the healthy cellular state is therefore of critical importance for gaining systems-level insights into disease mechanisms and ultimately for developing improved therapies. By integrating gene expression data with a protein interaction network to induce a stochastic dynamics on the network, we here demonstrate that cancer cells are characterised by an increase in the dynamic network entropy, compared to cells of normal physiology. Using a fundamental relation between the macroscopic resilience of a dynamical system and the uncertainty (entropy) in the underlying microscopic processes, we argue that cancer cells will be more robust to random gene perturbations. In addition, we formally demonstrate that gene expression differences between normal and cancer tissue are anticorrelated with local dynamic entropy changes, thus providing a systemic link between gene expression changes at the nodes and their local network dynamics. In particular, we also find that genes which drive cell-proliferation in cancer cells and which often encode oncogenes are associated with reductions in the dynamic network entropy. In summary, our results support the view that the observed increased robustness of cancer cells to perturbation and therapy may be due to an increase in the dynamic network entropy that allows cells to adapt to the new cellular stresses. Conversely, genes that exhibit local flux entropy decreases in cancer may render cancer cells more susceptible to targeted intervention and may therefore represent promising drug targets.Comment: 10 pages, 3 figures, 4 tables. Submitte

    Genomic aberrations relate early and advanced stage ovarian cancer

    Get PDF
    Background Because of the distinct clinical presentation of early and advanced stage ovarian cancer, we aim to clarify whether these disease entities are solely separated by time of diagnosis or whether they arise from distinct molecular events. Methods Sixteen early and sixteen advanced stage ovarian carcinomas, matched for histological subtype and differentiation grade, were included. Genomic aberrations were compared for each early and advanced stage ovarian cancer by array comparative genomic hybridization. To study how the aberrations correlate to the clinical characteristics of the tumors we clustered tumors based on the genomic aberrations. Results The genomic aberration patterns in advanced stage cancer equalled those in early stage, but were more frequent in advanced stage (p=0.012). Unsupervised clustering based on genomic aberrations yielded two clusters that significantly discriminated early from advanced stage (p= 0.001), and that did differ significantly in survival (p= 0.002). These clusters however did give a more accurate prognosis than histological subtype or differentiation grade. Conclusion This study indicates that advanced stage ovarian cancer either progresses from early stage or from a common precursor lesion but that they do not arise from distinct carcinogenic molecular events. Furthermore, we show that array comparative genomic hybridization has the potential to identify clinically distinct patients

    4H Leukodystrophy: A Brain Magnetic Resonance Imaging Scoring System

    Get PDF
    4H (hypomyelination, hypodontia and hypogonadotropic hypogonadism) leukodystrophy (4H) is an autosomal recessive hypomyelinating white matter (WM) disorder with neurologic, dental, and endocrine abnormalities. The aim of this study was to develop and validate a magnetic resonance imaging (MRI) scoring system for 4H. A scoring system (0-54) was developed to quantify hypomyelination and atrophy of different brain regions. Pons diameter and bicaudate ratio were included as measures of cerebral and brainstem atrophy, and reference values were determined using controls. Five independent raters completed the scoring system in 40 brain MRI scans collected from 36 patients with genetically proven 4H. Interrater reliability (IRR) and correlations between MRI scores, age, gross motor function, gender, and mutated gene were assessed. IRR for total MRI severity was found to be excellent (intraclass correlation coefficient: 0.87; 95% confidence interval: 0.80-0.92) but varied between different items with some (e.g., myelination of the cerebellar WM) showing poor IRR. Atrophy increased with age in contrast to hypomyelination scores. MRI scores (global, hypomyelination, and atrophy scores) significantly correlated with clinical handicap (p < 0.01 for all three items) and differed between the different genotypes. Our 4H MRI scoring system reliably quantifies hypomyelination and atrophy in patients with 4H, and MRI scores reflect clinical disease severity

    Linear and non-linear dependencies between copy number aberrations and mRNA expression reveal distinct molecular pathways in breast cancer

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Elucidating the exact relationship between gene copy number and expression would enable identification of regulatory mechanisms of abnormal gene expression and biological pathways of regulation. Most current approaches either depend on linear correlation or on nonparametric tests of association that are insensitive to the exact shape of the relationship. Based on knowledge of enzyme kinetics and gene regulation, we would expect the functional shape of the relationship to be gene dependent and to be related to the gene regulatory mechanisms involved. Here, we propose a statistical approach to investigate and distinguish between linear and nonlinear dependences between DNA copy number alteration and mRNA expression.</p> <p>Results</p> <p>We applied the proposed method to DNA copy numbers derived from Illumina 109 K SNP-CGH arrays (using the log R values) and expression data from Agilent 44 K mRNA arrays, focusing on commonly aberrated genomic loci in a collection of 102 breast tumors. Regression analysis was used to identify the type of relationship (linear or nonlinear), and subsequent pathway analysis revealed that genes displaying a linear relationship were overall associated with substantially different biological processes than genes displaying a nonlinear relationship. In the group of genes with a linear relationship, we found significant association to canonical pathways, including purine and pyrimidine metabolism (for both deletions and amplifications) as well as estrogen metabolism (linear amplification) and BRCA-related response to damage (linear deletion). In the group of genes displaying a nonlinear relationship, the top canonical pathways were specific pathways like PTEN and PI13K/AKT (nonlinear amplification) and Wnt(B) and IL-2 signalling (nonlinear deletion). Both amplifications and deletions pointed to the same affected pathways and identified cancer as the top significant disease and cell cycle, cell signaling and cellular development as significant networks.</p> <p>Conclusions</p> <p>This paper presents a novel approach to assessing the validity of the dependence of expression data on copy number data, and this approach may help in identifying the drivers of carcinogenesis.</p

    Incorporating gene co-expression network in identification of cancer prognosis markers

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Extensive biomedical studies have shown that clinical and environmental risk factors may not have sufficient predictive power for cancer prognosis. The development of high-throughput profiling technologies makes it possible to survey the whole genome and search for genomic markers with predictive power. Many existing studies assume the interchangeability of gene effects and ignore the coordination among them.</p> <p>Results</p> <p>We adopt the weighted co-expression network to describe the interplay among genes. Although there are several different ways of defining gene networks, the weighted co-expression network may be preferred because of its computational simplicity, satisfactory empirical performance, and because it does not demand additional biological experiments. For cancer prognosis studies with gene expression measurements, we propose a new marker selection method that can properly incorporate the network connectivity of genes. We analyze six prognosis studies on breast cancer and lymphoma. We find that the proposed approach can identify genes that are significantly different from those using alternatives. We search published literature and find that genes identified using the proposed approach are biologically meaningful. In addition, they have better prediction performance and reproducibility than genes identified using alternatives.</p> <p>Conclusions</p> <p>The network contains important information on the functionality of genes. Incorporating the network structure can improve cancer marker identification.</p

    Expression Signature of IFN/STAT1 Signaling Genes Predicts Poor Survival Outcome in Glioblastoma Multiforme in a Subtype-Specific Manner

    Get PDF
    Previous reports have implicated an induction of genes in IFN/STAT1 (Interferon/STAT1) signaling in radiation resistant and prosurvival tumor phenotypes in a number of cancer cell lines, and we have hypothesized that upregulation of these genes may be predictive of poor survival outcome and/or treatment response in Glioblastoma Multiforme (GBM) patients. We have developed a list of 8 genes related to IFN/STAT1 that we hypothesize to be predictive of poor survival in GBM patients. Our working hypothesis that over-expression of this gene signature predicts poor survival outcome in GBM patients was confirmed, and in addition, it was demonstrated that the survival model was highly subtype-dependent, with strong dependence in the Proneural subtype and no detected dependence in the Classical and Mesenchymal subtypes. We developed a specific multi-gene survival model for the Proneural subtype in the TCGA (the Cancer Genome Atlas) discovery set which we have validated in the TCGA validation set. In addition, we have performed network analysis in the form of Bayesian Network discovery and Ingenuity Pathway Analysis to further dissect the underlying biology of this gene signature in the etiology of GBM. We theorize that the strong predictive value of the IFN/STAT1 gene signature in the Proneural subtype may be due to chemotherapy and/or radiation resistance induced through prolonged constitutive signaling of these genes during the course of the illness. The results of this study have implications both for better prediction models for survival outcome in GBM and for improved understanding of the underlying subtype-specific molecular mechanisms for GBM tumor progression and treatment response

    Accounting for uncertainty when assessing association between copy number and disease: a latent class model

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Copy number variations (CNVs) may play an important role in disease risk by altering dosage of genes and other regulatory elements, which may have functional and, ultimately, phenotypic consequences. Therefore, determining whether a CNV is associated or not with a given disease might be relevant in understanding the genesis and progression of human diseases. Current stage technology give CNV probe signal from which copy number status is inferred. Incorporating uncertainty of CNV calling in the statistical analysis is therefore a highly important aspect. In this paper, we present a framework for assessing association between CNVs and disease in case-control studies where uncertainty is taken into account. We also indicate how to use the model to analyze continuous traits and adjust for confounding covariates.</p> <p>Results</p> <p>Through simulation studies, we show that our method outperforms other simple methods based on inferring the underlying CNV and assessing association using regular tests that do not propagate call uncertainty. We apply the method to a real data set in a controlled MLPA experiment showing good results. The methodology is also extended to illustrate how to analyze aCGH data.</p> <p>Conclusion</p> <p>We demonstrate that our method is robust and achieves maximal theoretical power since it accommodates uncertainty when copy number status are inferred. We have made <monospace>R</monospace> functions freely available.</p

    Genomic profiling identifies common HPV-associated chromosomal alterations in squamous cell carcinomas of cervix and head and neck

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>It is well known that a persistent infection with high-risk human papillomavirus (hrHPV) is causally involved in the development of squamous cell carcinomas of the uterine cervix (CxSCCs) and a subset of SCCs of the head and neck (HNSCCs). The latter differ from hrHPV-negative HNSCCs at the clinical and molecular level.</p> <p>Methods</p> <p>To determine whether hrHPV-associated SCCs arising from different organs have specific chromosomal alterations in common, we compared genome-wide chromosomal profiles of 10 CxSCCs (all hrHPV-positive) with 12 hrHPV-positive HNSCCs and 30 hrHPV-negative HNSCCs. Potential organ-specific alterations and alterations shared by SCCs in general were investigated as well.</p> <p>Results</p> <p>Unsupervised hierarchical clustering resulted in one mainly hrHPV-positive and one mainly hrHPV-negative cluster. Interestingly, loss at 13q and gain at 20q were frequent in HPV-positive carcinomas of both origins, but uncommon in hrHPV-negative HNSCCs, indicating that these alterations are associated with hrHPV-mediated carcinogenesis. Within the group of hrHPV-positive carcinomas, HNSCCs more frequently showed gains of multiple regions at 8q whereas CxSCCs more often showed loss at 17p. Finally, gains at 3q24-29 and losses at 11q22.3-25 were frequent (>50%) in all sample groups.</p> <p>Conclusion</p> <p>In this study hrHPV-specific, organ-specific, and pan-SCC chromosomal alterations were identified. The existence of hrHPV-specific alterations in SCCs of different anatomical origin, suggests that these alterations are crucial for hrHPV-mediated carcinogenesis.</p

    SignS: a parallelized, open-source, freely available, web-based tool for gene selection and molecular signatures for survival and censored data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Censored data are increasingly common in many microarray studies that attempt to relate gene expression to patient survival. Several new methods have been proposed in the last two years. Most of these methods, however, are not available to biomedical researchers, leading to many re-implementations from scratch of ad-hoc, and suboptimal, approaches with survival data.</p> <p>Results</p> <p>We have developed SignS (Signatures for Survival data), an open-source, freely-available, web-based tool and R package for gene selection, building molecular signatures, and prediction with survival data. SignS implements four methods which, according to existing reviews, perform well and, by being of a very different nature, offer complementary approaches. We use parallel computing via MPI, leading to large decreases in user waiting time. Cross-validation is used to asses predictive performance and stability of solutions, the latter an issue of increasing concern given that there are often several solutions with similar predictive performance. Biological interpretation of results is enhanced because genes and signatures in models can be sent to other freely-available on-line tools for examination of PubMed references, GO terms, and KEGG and Reactome pathways of selected genes.</p> <p>Conclusion</p> <p>SignS is the first web-based tool for survival analysis of expression data, and one of the very few with biomedical researchers as target users. SignS is also one of the few bioinformatics web-based applications to extensively use parallelization, including fault tolerance and crash recovery. Because of its combination of methods implemented, usage of parallel computing, code availability, and links to additional data bases, SignS is a unique tool, and will be of immediate relevance to biomedical researchers, biostatisticians and bioinformaticians.</p
    corecore